System and method for predicting maximum cooler and rack capacities in a data center

ABSTRACT

A system and method for evaluating equipment in a data center, the equipment including a plurality of equipment racks, and at least one cooling provider. In one aspect, a method includes receiving data regarding each of the plurality of equipment racks and the at least one cooling provider, the data including a layout of the equipment racks and the at least one cooling provider, a power draw value for each of the equipment racks, and a maximum cooler capacity value for the at least one cooling provider; storing the received data; determining cooling performance of at least one of the plurality of equipment racks based on the layout; determining a cooling load for the at least one cooler and a difference between the cooling load and the maximum cooler capacity value; for each equipment rack, determining a maximum rack capacity based on the layout and the difference between the cooling load and the maximum cooler capacity value; and displaying an indication of the maximum rack capacity for each equipment rack.

BACKGROUND

1. Field of the Invention

At least one embodiment in accordance with the present invention relates generally to systems and methods for data center management and design, and more specifically, to systems and methods for predicting maximum cooler and rack capacities in a data center.

2. Discussion of Related Art

In response to the increasing demands of information-based economies, information technology networks continue to proliferate across the globe. One manifestation of this growth is the centralized network data center. A centralized network data center typically consists of various information technology equipment, collocated in a structure that provides network connectivity, electrical power and cooling capacity. Often the equipment is housed in specialized enclosures termed “racks” which integrate these connectivity, power and cooling elements. In some data center configurations, these rows are organized into hot and cold aisles to decrease the cost associated with cooling the information technology equipment. These characteristics make data centers a cost effective way to deliver the computing power required by many software applications.

Various processes and software applications, such as the InfrastruXure® Central product available from American Power Conversion Corporation of West Kingston, R.I., have been developed to aid data center personnel in designing and maintaining efficient and effective data center configurations. These tools often guide data center personnel through activities such as designing the data center structure, positioning equipment within the data center prior to installation and repositioning equipment after construction and installation are complete. Thus, conventional tool sets provide data center personnel with a standardized and predictable design methodology.

SUMMARY OF THE INVENTION

A first aspect of the invention is directed to a computer-implemented method for evaluating equipment in a data center, the equipment including a plurality of equipment racks, and at least one cooling provider. The method includes receiving data regarding each of the plurality of equipment racks and the at least one cooling provider, the data including a layout of the equipment racks and the at least one cooling provider, and a power draw value for each of the equipment racks, storing the received data, determining maximum cooler capacity for the at least one cooling provider based on the layout and the power draw, for each equipment rack, determining a maximum rack capacity based on the layout and the maximum cooler capacity, and displaying an indication of the maximum rack capacity for at least one equipment rack.

The at least one cooling provider may be a plurality of cooling providers, and the method may further include determining a maximum cooler capacity value for each cooling provider based on a maximum air return temperature to the plurality of cooling providers. In the method, determining the maximum rack capacity for each equipment rack may include determining the maximum rack capacity based on available space in each equipment rack and based on available power of each equipment rack. The method may further include determining cooling performance of each equipment rack based on air flows in the data center. The method may also include determining a cooling load for the at least one cooling provider based on an ambient temperature determined based on a difference between total cooling load in the data center and total power load in the data center. In the method, determining a cooling load for the at least one cooling provider may include determining the cooler load based on a cooler return temperature determined based on the ambient temperature, and displaying an indication of the maximum rack capacity may include displaying a model of the data center with the indication of the maximum rack capacity for an equipment rack displayed along with a model of the equipment rack.

Another aspect of the invention is directed to a system for evaluating equipment in a data center, the equipment including a plurality of equipment racks, and at least one cooling provider. The system includes an interface, and a controller coupled to the interface and configured to receive data regarding each of the plurality of equipment racks and the at least one cooling provider, the data including a layout of the equipment racks and the at least one cooling provider, and a power draw value for each of the equipment racks, store the received data in a storage device associated with the system, determine a maximum cooler capacity value for the at least one cooling provider based on the layout and the power draw, and for each equipment rack, determine a maximum rack capacity based on the layout and the maximum cooler capacity value.

In the system, the at least one cooling provider may be a plurality of cooling providers, and the controller may be further configured to determine a maximum cooler capacity value for each of the plurality of cooling providers. The controller may be configured to determine the maximum rack capacity based on available space in each equipment rack and based on available power of each equipment rack. The controller may be further configured to determine cooling performance of each equipment rack based on air flows in the data center, and to determine cooler load for the at least one cooling provider based on an ambient temperature determined based on a difference between total cooling load in the data center and total power load in the data center. The controller may also be configured to determine the cooling load based on a cooler return temperature determined based on the ambient temperature. The system may further include a display coupled to the controller, and the controller may be configured to display an indication of the maximum rack capacity.

Another aspect of the invention is directed to a computer readable medium having stored thereon sequences of instruction including instructions that will cause a processor to receive data regarding each of a plurality of equipment racks and at least one cooling provider, the data including a layout of the equipment racks and the at least one cooling provider, and a power draw value for each of the equipment racks, store the received data in a storage device, determine a maximum cooler capacity value for the at least one cooling provider based on the layout and the power draw; and for each equipment rack, determine a maximum rack capacity based on the layout and the maximum cooler capacity value. The at least one cooling provider may be a plurality of cooling providers, and the sequences of instructions may include instructions that will cause the processor to determine a maximum cooler capacity value for each cooling provider, and determine the maximum rack capacity based on available space in each equipment rack and based on available power of each equipment rack. The sequences of instructions may include instructions that will cause the processor to determine cooling performance of each equipment rack based on air flows in the data center. The sequences of instructions may include instructions that will cause the processor to determine a cooling load for the at least one cooling provider based on an ambient temperature determined based on a difference between total cooling load in the data center and total power load in the data center. The sequences of instructions may also include instructions that will cause the processor to determine the cooler load based on a cooler return temperature determined based on the ambient temperature.

BRIEF DESCRIPTION OF DRAWINGS

The accompanying drawings are not intended to be drawn to scale. In the drawings, each identical or nearly identical component that is illustrated in various figures is represented by a like numeral. For purposes of clarity, not every component may be labeled in every drawing. In the drawings:

FIG. 1 shows an example computer system with which various aspects in accord with the present invention may be implemented;

FIG. 2 illustrates an example distributed system including an embodiment;

FIG. 3 shows an example of a display screen in accordance with one embodiment of the invention;

FIG. 4 shows a flowchart of a first process in accordance with one embodiment of the invention;

FIG. 5 shows a flowchart of a second process in accordance with one embodiment of the invention;

FIG. 6 shows a flowchart of a third process in accordance with one embodiment of the invention;

FIG. 7 shows a flowchart of a fourth process in accordance with one embodiment of the invention;

FIG. 8 shows a flowchart of a fifth process in accordance with one embodiment of the invention;

FIG. 9 shows a flowchart of a sixth process in accordance with one embodiment of the invention; and

FIG. 10 shows a flowchart of a seventh process in accordance with one embodiment of the invention.

DETAILED DESCRIPTION

At least some embodiments in accordance with the present invention relate to systems and processes through which a user may design and analyze data center configurations. These systems and processes may facilitate this design and analysis activity by allowing the user to create models of data center configurations from which performance metrics may be determined. Both the systems and the user may employ these performance metrics to determine alternative data center configurations that meet various design objectives.

As described in U.S. patent application Ser. No. 12/019,109, titled “System and Method for Evaluating Equipment Rack Cooling”, filed Jan. 24, 2008 (referred to herein as “the '109 Application”), and in U.S. patent application Ser. No. 11/342,300, titled “Methods and Systems for Managing Facility Power and Cooling” filed Jan. 27, 2006 (referred to herein as “the '300 application”), both of which are assigned to the assignee of the present application, and both of which are hereby incorporated herein by reference in their entirety, typical equipment racks in modern data centers draw cooling air in the front of the rack and exhaust air out the rear of the rack. The equipment racks, and in-row coolers are typically arranged in rows in an alternating front/back arrangement creating alternating hot and cool aisles in a data center with the front of each row of racks facing the cool aisle and the rear of each row of racks facing the hot aisle. Adjacent rows of equipment racks separated by a cool aisle may be referred to as a cool aisle cluster, and adjacent rows of equipment racks separated by a hot aisle may be referred to as a hot aisle cluster. Further, single rows of equipment may also be considered to form both a cold and a hot aisle cluster by themselves. As readily apparent to one of ordinary skill in the art, a row of equipment racks may be part of multiple hot aisle clusters and multiple cool aisle clusters. In descriptions and claims herein, equipment in racks, or the racks themselves, may be referred to as cooling consumers, and in-row cooling units and/or computer room air conditioners (CRACs) may be referred to as cooling providers. In the referenced applications, tools are provided for analyzing the cooling performance of a cluster of racks in a data center. In these tools, multiple analyses may be performed on different layouts to attempt to optimize the cooling performance of the data center.

In at least one embodiment, a method is provided for performing in real-time an analysis on a layout of equipment in a data center, for determining the maximum capacity of coolers in the layout, and based on the maximum capacity of the coolers, and other considerations discussed below, providing the maximum electrical load for equipment racks co-located with the coolers. The method may be incorporated in a system or tool having capabilities for predicting the cooling performance of clusters and for performing other design and analysis functions of equipment in a data center.

The aspects disclosed herein in accordance with the present invention, are not limited in their application to the details of construction and the arrangement of components set forth in the following description or illustrated in the drawings. These aspects are capable of assuming other embodiments and of being practiced or of being carried out in various ways. Examples of specific implementations are provided herein for illustrative purposes only and are not intended to be limiting. In particular, acts, elements and features discussed in connection with any one or more embodiments are not intended to be excluded from a similar role in any other embodiments.

For example, according to one embodiment of the present invention, a computer system is configured to perform any of the functions described herein, including but not limited to, configuring, modeling and presenting information regarding specific data center configurations. Further, computer systems in embodiments may be used to automatically measure environmental parameters in a data center, and control equipment, such as chillers or coolers to optimize performance. Moreover, the systems described herein may be configured to include or exclude any of the functions discussed herein. Thus the invention is not limited to a specific function or set of functions. Also, the phraseology and terminology used herein is for the purpose of description and should not be regarded as limiting. The use herein of “including,” “comprising,” “having,” “containing,” “involving,” and variations thereof is meant to encompass the items listed thereafter and equivalents thereof as well as additional items.

Computer System

Various aspects and functions described herein in accordance with the present invention may be implemented as hardware or software on one or more computer systems. There are many examples of computer systems currently in use. These examples include, among others, network appliances, personal computers, workstations, mainframes, networked clients, servers, media servers, application servers, database servers and web servers. Other examples of computer systems may include mobile computing devices, such as cellular phones and personal digital assistants, and network equipment, such as load balancers, routers and switches. Further, aspects in accordance with the present invention may be located on a single computer system or may be distributed among a plurality of computer systems connected to one or more communications networks.

For example, various aspects and functions may be distributed among one or more computer systems configured to provide a service to one or more client computers, or to perform an overall task as part of a distributed system. Additionally, aspects may be performed on a client-server or multi-tier system that includes components distributed among one or more server systems that perform various functions. Thus, the invention is not limited to executing on any particular system or group of systems. Further, aspects may be implemented in software, hardware or firmware, or any combination thereof. Thus, aspects in accordance with the present invention may be implemented within methods, acts, systems, system elements and components using a variety of hardware and software configurations, and the invention is not limited to any particular distributed architecture, network, or communication protocol.

FIG. 1 shows a block diagram of a distributed computer system 100, in which various aspects and functions in accord with the present invention may be practiced. Distributed computer system 100 may include one more computer systems. For example, as illustrated, distributed computer system 100 includes computer systems 102, 104 and 106. As shown, computer systems 102, 104 and 106 are interconnected by, and may exchange data through, communication network 108. Network 108 may include any communication network through which computer systems may exchange data. To exchange data using network 108, computer systems 102, 104 and 106 and network 108 may use various methods, protocols and standards, including, among others, token ring, ethernet, wireless ethernet, Bluetooth, TCP/IP, UDP, Http, FTP, SNMP, SMS, MMS, SS7, Json, Soap, and Corba. To ensure data transfer is secure, computer systems 102, 104 and 106 may transmit data via network 108 using a variety of security measures including TSL, SSL or VPN among other security techniques. While distributed computer system 100 illustrates three networked computer systems, distributed computer system 100 may include any number of computer systems and computing devices, networked using any medium and communication protocol.

Various aspects and functions in accordance with the present invention may be implemented as specialized hardware or software executing in one or more computer systems including computer system 102 shown in FIG. 1. As depicted, computer system 102 includes processor 110, memory 112, bus 114, interface 116 and storage 118. Processor 110 may perform a series of instructions that result in manipulated data. Processor 110 may be a commercially available processor such as an Intel Pentium, Motorola PowerPC, SGI MIPS, Sun UltraSPARC, or Hewlett-Packard PA-RISC processor, but may be any type of processor or controller as many other processors and controllers are available. Processor 110 is connected to other system elements, including one or more memory devices 112, by bus 114.

Memory 112 may be used for storing programs and data during operation of computer system 102. Thus, memory 112 may be a relatively high performance, volatile, random access memory such as a dynamic random access memory (DRAM) or static memory (SRAM). However, memory 112 may include any device for storing data, such as a disk drive or other non-volatile storage device. Various embodiments in accordance with the present invention may organize memory 112 into particularized and, in some cases, unique structures to perform the aspects and functions disclosed herein.

Components of computer system 102 may be coupled by an interconnection element such as bus 114. Bus 114 may include one or more physical busses, for example, busses between components that are integrated within a same machine, but may include any communication coupling between system elements including specialized or standard computing bus technologies such as IDE, SCSI, PCI and InfiniBand. Thus, bus 114 enables communications, for example, data and instructions, to be exchanged between system components of computer system 102.

Computer system 102 also includes one or more interface devices 116 such as input devices, output devices and combination input/output devices. Interface devices may receive input or provide output. More particularly, output devices may render information for external presentation. Input devices may accept information from external sources. Examples of interface devices include keyboards, mouse devices, trackballs, microphones, touch screens, printing devices, display screens, speakers, network interface cards, etc. Interface devices allow computer system 102 to exchange information and communicate with external entities, such as users and other systems.

Storage system 118 may include a computer readable and writeable nonvolatile storage medium in which instructions are stored that define a program to be executed by the processor. Storage system 118 also may include information that is recorded, on or in, the medium, and this information may be processed by the program. More specifically, the information may be stored in one or more data structures specifically configured to conserve storage space or increase data exchange performance. The instructions may be persistently stored as encoded signals, and the instructions may cause a processor to perform any of the functions described herein. The medium may, for example, be optical disk, magnetic disk or flash memory, among others. In operation, the processor or some other controller may cause data to be read from the nonvolatile recording medium into another memory, such as memory 112, that allows for faster access to the information by the processor than does the storage medium included in storage system 118. The memory may be located in storage system 118 or in memory 112, however, processor 110 may manipulate the data within the memory 112, and then copies the data to the medium associated with storage system 118 after processing is completed. A variety of components may manage data movement between the medium and integrated circuit memory element and the invention is not limited thereto. Further, the invention is not limited to a particular memory system or storage system.

Although computer system 102 is shown by way of example as one type of computer system upon which various aspects and functions in accordance with the present invention may be practiced, aspects of the invention are not limited to being implemented on the computer system as shown in FIG. 1. Various aspects and functions in accord with the present invention may be practiced on one or more computers having a different architectures or components than that shown in FIG. 1. For instance, computer system 102 may include specially-programmed, special-purpose hardware, such as for example, an application-specific integrated circuit (ASIC) tailored to perform a particular operation disclosed herein. While another embodiment may perform the same function using several general-purpose computing devices running MAC OS System X with Motorola PowerPC processors and several specialized computing devices running proprietary hardware and operating systems.

Computer system 102 may be a computer system including an operating system that manages at least a portion of the hardware elements included in computer system 102. Usually, a processor or controller, such as processor 110, executes an operating system which may be, for example, a Windows-based operating system, such as, Windows NT, Windows 2000 (Windows ME), Windows XP or Windows Vista operating systems, available from the Microsoft Corporation, a MAC OS System X operating system available from Apple Computer, one of many Linux-based operating system distributions, for example, the Enterprise Linux operating system available from Red Hat Inc., a Solaris operating system available from Sun Microsystems, or a UNIX operating system available from various sources. Many other operating systems may be used, and embodiments are not limited to any particular implementation.

The processor and operating system together define a computer platform for which application programs in high-level programming languages may be written. These component applications may be executable, intermediate, for example, C−, bytecode or interpreted code which communicates over a communication network, for example, the Internet, using a communication protocol, for example, TCP/IP. Similarly, aspects in accord with the present invention may be implemented using an object-oriented programming language, such as .Net, SmallTalk, Java, C++, Ada, or C♯ (C-Sharp). Other object-oriented programming languages may also be used. Alternatively, functional, scripting, or logical programming languages may be used.

Additionally, various aspects and functions in accordance with the present invention may be implemented in a non-programmed environment, for example, documents created in HTML, XML or other format that, when viewed in a window of a browser program, render aspects of a graphical-user interface or perform other functions. Further, various embodiments in accord with the present invention may be implemented as programmed or non-programmed elements, or any combination thereof. For example, a web page may be implemented using HTML while a data object called from within the web page may be written in C++. Thus, the invention is not limited to a specific programming language and any suitable programming language could also be used. Further, in at least one embodiment, the tool may be implemented using VBA Excel.

A computer system included within an embodiment may perform additional functions outside the scope of the invention. For instance, aspects of the system may be implemented using an existing commercial product, such as, for example, Database Management Systems such as SQL Server available from Microsoft of Seattle Wash., Oracle Database from Oracle of Redwood Shores, Calif., and MySQL from MySQL AB of Uppsala, Sweden or integration software such as Web Sphere middleware from IBM of Armonk, N.Y. However, a computer system running, for example, SQL Server may be able to support both aspects in accord with the present invention and databases for sundry applications not within the scope of the invention.

Example System Architecture

FIG. 2 presents a context diagram including physical and logical elements of distributed system 200. As shown, distributed system 200 is specially configured in accordance with the present invention. The system structure and content recited with regard to FIG. 2 is for exemplary purposes only and is not intended to limit the invention to the specific structure shown in FIG. 2. As will be apparent to one of ordinary skill in the art, many variant system structures can be architected without deviating from the scope of the present invention. The particular arrangement presented in FIG. 2 was chosen to promote clarity.

Information may flow between the elements, components and subsystems depicted in FIG. 2 using any technique. Such techniques include, for example, passing the information over the network via TCP/IP, passing the information between modules in memory and passing the information by writing to a file, database, or some other non-volatile storage device. Other techniques and protocols may be used without departing from the scope of the invention.

Referring to FIG. 2, system 200 includes user 202, interface 204, data center design and management system 206, communications network 208 and data center database 210. System 200 may allow user 202, such as a data center architect or other data center personnel, to interact with interface 204 to create or modify a model of one or more data center configurations. According to one embodiment, interface 204 may include aspects of the floor editor and the rack editor as disclosed in Patent Cooperation Treaty Application No. PCT/US08/63675, entitled METHODS AND SYSTEMS FOR MANAGING FACILITY POWER AND COOLING, filed on May 15, 2008, which is incorporated herein by reference in its entirety and is hereinafter referred to as PCT/US08/63675. In other embodiments, interface 204 may be implemented with specialized facilities that enable user 202 to design, in a drag and drop fashion, a model that includes a representation of the physical layout of a data center or any subset thereof. This layout may include representations of data center structural components as well as data center equipment. The features of interface 204, as may be found in various embodiments in accordance with the present invention, are discussed further below. In at least one embodiment, information regarding a data center is entered into system 200 through the interface, and assessments and recommendations for the data center are provided to the user. Further, in at least one embodiment, optimization processes may be performed to optimize cooling performance and energy usage of the data center.

As shown in FIG. 2, data center design and management system 206 presents data design interface 204 to user 202. According to one embodiment, data center design and management system 206 may include the data center design and management system as disclosed in PCT/US08/63675. In this embodiment, design interface 204 may incorporate functionality of the input module, the display module and the builder module included in PCT/US08/63675 and may use the database module to store and retrieve data.

As illustrated, data center design and management system 206 may exchange information with data center database 210 via network 208. This information may include any information required to support the features and functions of data center design and management system 206. For example, in one embodiment, data center database 210 may include at least some portion of the data stored in the data center equipment database described in PCT/US08/63675. In another embodiment, this information may include any information required to support interface 204, such as, among other data, the physical layout of one or more data center model configurations, the production and distribution characteristics of the cooling providers included in the model configurations, the consumption characteristics of the cooling consumers in the model configurations, and a listing of equipment racks and cooling providers to be included in a cluster.

In one embodiment, data center database 210 may store types of cooling providers, the amount of cool air provided by each type of cooling provider, and a temperature of cool air provided by the cooling provider. Thus, for example, data center database 210 includes records of a particular type of CRAC unit that is rated to deliver airflow at the rate of 5,600 cfm at a temperature of 68 degrees Fahrenheit. In addition, the data center database 210 may store one or more cooling metrics, such as inlet and outlet temperatures of the CRACs and inlet and outlet temperatures of one or more equipment racks. The temperatures may be periodically measured and input into the system, or in other embodiments, the temperatures may be continuously monitored using devices coupled to the system 200.

Data center database 210 may take the form of any logical construction capable of storing information on a computer readable medium including, among other structures, flat files, indexed files, hierarchical databases, relational databases or object oriented databases. The data may be modeled using unique and foreign key relationships and indexes. The unique and foreign key relationships and indexes may be established between the various fields and tables to ensure both data integrity and data interchange performance.

The computer systems shown in FIG. 2, which include data center design and management system 206, network 208 and data center equipment database 210, each may include one or more computer systems. As discussed above with regard to FIG. 1, computer systems may have one or more processors or controllers, memory and interface devices. The particular configuration of system 200 depicted in FIG. 2 is used for illustration purposes only and embodiments of the invention may be practiced in other contexts. Thus, embodiments of the invention are not limited to a specific number of users or systems.

In at least one embodiment, which will now be described, a tool is provided for determining the maximum cooler capacity of a cooler as installed in a data center, for determining the maximum rack load that can be placed at a rack position in the data center, based at least in part on the maximum cooler capacity, and for determining cooler return temperature for one or more coolers in the data center. Once determined, the current cooling load relative to the maximum cooler capacity may be displayed for each cooler on a representation of the cooler on a layout of the data center using for example a bar chart. FIG. 3 shows a model of a hot aisle cluster 300 of equipment racks 302 and in-row coolers 304 in a data center that may be displayed using a system in accordance with one embodiment. The displayed model of the cluster includes a bar graph 306 on each rack that provides an indication of the maximum rack load for each rack and an indication of what part or percentage of that maximum rack load is presently being utilized. In one embodiment, the maximum rack load may be determined based on the lesser of the maximum power available to the rack and the maximum cooling capacity of the rack. Similarly, the displayed model of the cluster includes a bar graph 308 on each cooler that provides an indication of the maximum cooler capacity of each cooler and the current cooling load for each cooler.

While FIG. 3 uses bar graphs to provide indications of capacity and current loading, in other embodiments, other graphical and numerical representations may be used. Systems, tools and methods for determining the capacities and loading shown in FIG. 3 in embodiments of the invention will now be described.

The current cooling load for an installed cooler can be monitored (directly or indirectly) in real time, and a fixed nominal capacity for a cooler (e.g. 17 kW) can be determined for a cooler from manufacturing specifications. The installed maximum cooler capacity is more complex and in at least one embodiment described herein is determined based on the building cooling infrastructure (e.g. water/glycol flow rates and temperatures for chilled water units) and the return air temperature to the cooler. As the maximum cooler capacity increases with return temperature, the maximum cooler capacity for a given installation will be achieved at maximum return air temperature. Thus in one embodiment, to estimate the maximum cooler capacity, the details of the building cooling infrastructure are used along with a practical maximum return air temperature that is estimated based on the data center environment (location, airflow, and power of IT equipment, geometric details, etc.) in the neighborhood of the cooler. In one embodiment, in generating this estimate, it is assumed that any remaining space in equipment racks in the neighborhood of a cooler will be filled with IT equipment of the same type and maximum power as presently exists on average. Procedures used to compute the maximum cooler capacity are described in further detail below.

In at least one embodiment, procedures for computing the maximum power load that may be installed in any given rack account for the fact that the load in one rack affects the cooling performance of other racks in the neighborhood. In computing the maximum equipment that can be installed in any one rack, the power in that rack is iteratively adjusted (increased) just to the point of a predicted cooling failure anywhere in the neighborhood. As rack power is increased, at some point the maximum cooler capacity may be exceeded, indicating that the cooler will no longer be able to meet the specified target supply temperature (i.e. it cannot cool the air all the way down to the set point), and in some embodiments, this indicates that maximum rack power has been reached. In other embodiments, some increase in supply temperature (particularly for just one cooler) may be acceptable, but as the supply temperature approaches the maximum desired rack inlet temperatures (e.g. 77° F.), cooling performance will typically become unacceptable. In at least one embodiment, maximum rack power calculations are performed as if all other racks remain at their current power level and, as soon as additional load is added to any one rack, the maximum rack capacity at other nearby rack locations will generally be reduced.

In one embodiment, before determining the maximum rack power for each rack, the maximum cooler capacity for each cooler in the cluster is determined using process 400 shown in FIG. 4 below. In general, the capacity of a cooler increases with an increase in return temperature to the cooler, and in process 400, the power loading of each rack in the cluster is increased up to a maximum power level limit for the cluster at which specified cooling criteria for the cluster are still met. At the maximum power level limit, a maximum return temperature for the cluster will be realized, and based on the maximum return temperature, the maximum power level for each cooler, as laid out in the cluster, can be determined.

In a first stage 402 of the process 400, information regarding a data center, including details on a particular cluster to be analyzed, is entered into the system. The information includes identifying information for cooling units in the cluster. The information may be entered manually by a user, provided to the system electronically, or may have previously been entered and stored in the system. In one embodiment, at least some of the information may be sensed by the system using sensors installed in the data center and/or through direct communication with the cooler over a communications network. In one embodiment, the information entered includes, for each cooler, the type of cooler (e.g. model number), the coolant type, the entering coolant temperature to the cooler, the current cooler water flow rate, and the valve position (from 0 to 100%) of the valve which determines how much coolant is distributed directly through the cooling coil and how much is diverted as bypass flow.

At stage 404 a determination is made as to whether there is available space and power in the racks of the cluster. If the outcome of stage 404 is NO, then process 400 ends at stage 412. In a situation where there is no additional power and space available in any of the racks, then each of the racks is operating at its maximum capacity. If the outcome of stage 404 is YES, then at stage 406, the power value for each of the racks having available power and space is increased by an incremental amount. In different embodiments of the invention, different schemes may be used for raising the power value for the racks. In one embodiment, each of the racks may be raised by a same amount (limited by total power and space availability), while in other embodiments, each may be raised by a similar percentage.

At stage 408, a cooling analysis is performed on the cluster using a cooling calculator. For the additional power added to the equipment racks being evaluated, in one embodiment, the additional cooling air flow (expressed in cfm/kW) is based on the average air flow requirements for the existing equipment in the equipment rack. In typical equipment racks used in data centers, a value of 160 cfm/kw can be used to determine required air flow based on power draw of the equipment. The cooling calculator used at stage 408 can be one of the calculators described in the '109 and '300 patent applications discussed above. In one embodiment, the cooling calculator uses the algebraic calculator for determining capture index (CI) discussed in the '109 application. The cold-aisle capture index for a rack is defined in at least some embodiments as the fraction of air ingested by the rack which originates from local cooling resources (e.g., perforated floor tiles or local coolers). The hot-aisle capture index is defined as the fraction of air exhausted by a rack which is captured by local extracts (e.g., local coolers or return vents). CI therefore varies between 0 and 100% with better cooling performance generally indicated by greater CI values. In a cold-aisle analysis, high CI's ensure that the bulk of the air ingested by a rack comes from local cooling resources rather than being drawn from the room environment or from air which may have already been heated by electronics equipment. In this case, rack inlet temperatures will closely track the perforated-tile airflow temperatures and, assuming these temperatures are within the desired range, acceptable cooling will be achieved. In a hot-aisle analysis, high CI's ensure that rack exhaust is captured locally and there is little heating of the surrounding room environment.

While good (high) CI values typically imply good cooling performance; low CI values do not necessarily imply unacceptable cooling performance. For example, in a rack in a raised-floor environment which draws most of its airflow from the surrounding room environment rather than from the perforated tiles, the rack's cold-aisle CI will be low; however, if the surrounding room environment is sufficiently cool, the rack's inlet temperature may still be acceptable. In this case, the rack's cooling needs are met by the external room environment rather than perforated tiles within the rack's cluster. If this process is repeated many times across the data center, facility cooling will be complex and may be unpredictable. High CI values lead to inherently scalable cluster layouts and more predictable room environments. In one embodiment of the present invention, the cooling performance of the cluster is considered satisfactory if the CI for all racks in the cluster is greater than 90% although this threshold typically increases as cooler supply and surrounding ambient temperatures approach the maximum target rack inlet temperature. In other embodiments, other cooling calculators may be used including CFD calculators.

At stage 410, a determination is made based on the results of the cooling analysis as to whether the cooling performance of the cluster is satisfactory. If the outcome of stage 410 is NO, then the process proceeds to stage 426 discussed below, where the maximum cooler capacity is set to the previous value. If the outcome of stage 410 is NO on the first iteration through process 400, then the there is no previous value computed for the maximum cooler capacity, and the maximum cooler capacities are determined using processes 500 and 600 discussed below. If the outcome of stage 410 is YES, then the process proceeds to stage 414, where the Load of each cooler in the cluster is determined. The Load for a cooler is the rate at which heat is removed by the cooler under calculated conditions. In one embodiment, the Load is determined using Equation (1) below: Load=ρ Q ^(air) c _(p) (T _(return) −T _(set point))   Equation (1)

Where: ρ is the density of air=1.19 kg/m³

-   -   Q^(air) is the cooler airflow rate (maximum rated airflow for         cooler type     -   c_(p) is the specific heat of air=1005 J/(kg° C.)     -   T_(return) is the cooler air return temperature determined above     -   T_(set point) is the cooler airflow supply temperature set point

At stage 416, the maximum cooler capacity for each cooler in the cluster is determined based on current operating conditions of the cluster. In one embodiment, the process used for determining maximum cooling capacity differs based on whether the particular cooler is a chilled water cooling unit or a direct expansion (DX) cooling unit. A process 500 for determining maximum cooling capacity for chilled water cooling units is discussed below with reference to FIG. 5, and a process 600 for determining maximum cooling capacity for DX cooling units is described below with reference to FIG. 6.

Next, at stage 418, a determination is made as to whether the Load for each cooler is less than or equal to the maximum cooler capacity Cap_(max). If the outcome of stage 418 is YES, then the process returns to stage 406, where the rack power for all racks is incrementally increased again. If the outcome of stage 418 is NO, then the process 400 proceeds to stage 420 where the cooler airflow supply temperature, which is subject to the Cap_(max) limit, is updated using Equation (2) below: T _(supply) =T _(return)−Cap_(max)/(ρ Q ^(air) c _(p))   Equation (2)

At stage 418, a “No” is achieved if cooling capacity is insufficient for any one or more coolers.

After the supply temperature is updated, the process proceeds to stage 422, where the cooling calculator is run again using the new value for supply temperature. At stage 424, a determination is made again as to whether the cooling results for the cluster are satisfactory. If the outcome of stage 424 is YES, then the process ends at stage 412 with the maximum cooler capacity for each of the coolers set equal to the value most recently determined at stage 416. If the outcome of stage 424 is NO, then the maximum cooler capacity for each of the coolers is set to the previous value at stage 426 and the process then ends at stage 412.

FIG. 5 provides a flowchart for a process 500 that may be used in a data center management system in accordance with one embodiment for computing maximum cooler capacity for chilled water cooling units in a data center, such as one of the coolers 306 of FIG. 3. In the exemplary embodiment, which will now be described, the process applies to InRow RC and RP chilled water units available from American Power Conversion Corporation by Schneider Electric, but as readily understood by one of ordinary skill in the art given the benefit of this disclosure, the process may be used with other chilled water units.

In a first stage 502 of the process 500, information regarding a data center, including details on a particular cluster to be analyzed, is entered into the system. This may be the same information entered in stage 402 of process 400 discussed above, and if entered in process 400, stage 502 may be skipped.

At stage 504 in the process, a determination is made as to whether the cluster is a proper cluster. A proper cluster is a grouping of two approximately-equal-length rows of equipment separated by a common cold or hot aisle; there are no gaps between equipment in the rows. At stage 506, for a proper cluster, the return temperature for each cooler in the cluster is calculated. In one embodiment, the return temperature for a proper cluster is determined as described in the '109 application referenced above, while in other embodiments, the return temperature is determined using the process described below. At stage 508, for improper clusters, in one embodiment, the return temperature is estimated using a process described further below. In other embodiments, for both proper and improper clusters, the return temperature may be determined using a full computational fluid dynamics (CFD) analysis, but such an analysis can not typically be performed in real time.

Once the return temperature has been determined at either stage 506 or 508, then at stage 510, the flow coefficient at current valve position (C_(v)) and the flow coefficient at maximum valve opening (C_(v) ^(max)) are determined. The flow coefficient C_(v) relates the coolant flow rate to the square root of the pressure drop across the bypass valve: Q=C_(v)√{square root over (ΔP)} and may be needed to compute the maximum coolant flow rate through the cooling coil of the cooling unit. In one embodiment, C_(v) and C_(v) ^(max) may be determined from data provided by the manufacturer of the cooler. The manufacturer's data for the cooling units may be stored in a system performing the process 500. At stage 512, the maximum cooler water flow rate as installed (Q_(max)) is determined. In one embodiment, Q_(max) is determined using Equation (3) below:

$\begin{matrix} {Q_{\max} = {{Min}\left( {{\frac{C_{v}^{\max}}{C_{v}}Q},{21\; G\; P\; M}} \right)}} & {{Equation}\mspace{14mu}(3)} \end{matrix}$

Where Q=the current coolant flow rate. In some embodiments, the cooling capacity is capped at particular flow rate (e.g. 21 GPM) which may be dictated by considerations such as the life cycle of the coil and piping systems.

In other embodiments, the maximum coolant flow rate may be known (e.g. directly reported by the cooling unit). In this case, there is no need for calculations involving flow coefficients.

In the last stage 514 of the process, the maximum cooler capacity Cap_(max) is determined using information provided by the cooler manufacturer's specifications based on the entering water temperature (EWT), the return temperature and Q_(max). In some embodiments, if a coolant other than water is used (for example, a glycol mixture), then a correction factor may be provided to the determined maximum cooler capacity.

The process 500 may be performed for coolers in a cluster individually, although return temperatures may be determined for all coolers in a cluster simultaneously using a cluster calculator as described below. In one embodiment, return temperature values are determined using maximum cooler airflow values.

A process 600 used for predicting maximum cooler capacity for direct expansion (DX) cooling units will now be described with reference to FIG. 6 which shows a flowchart of the process 600. In a first stage 602 of the process 600, information regarding a data center, including details on a particular cluster to be analyzed, is entered into the system. This may be the same information entered in stage 402 of process 400 discussed above, and if entered in process 400, stage 602 may be skipped. At stage 604 in the process, a determination is made as to whether the cluster is a proper cluster, and at either stage 606 (for a proper cluster) or at stage 608 (for an improper cluster), the cooler return temperature is determined for each cooler in the cluster using the same procedure as in process 500 discussed above. At stage 610 in the process, the maximum cooling capacity (Cap max) for each cooler is determined using the return temperature and manufacturer's specifications for the cooler.

After computing the maximum capacity of the coolers in the cluster, in one embodiment, the maximum rack capacity is determined using the process 700 shown in flowchart form in FIG. 7. Process 700 is similar in some aspects to process 400 discussed above, except that process 700 is preformed individually for each equipment rack in a cluster. In a first stage 702 of the process 700, information regarding a data center, including details on a particular cluster to be analyzed is entered in to a system configured to perform the process 700. As with processes 500 and 600, the information may be entered into the system in a number of different ways. The information may include cooler specific information entered and or determined in processes 400, 500 and 600 above. In at least one embodiment, the process 700 is performed using the same system used to determine Cap_(max) for the coolers in the cluster.

At stage 704 of the process, a determination is made based on the information entered regarding the cluster as to whether there is available space and available power in a first equipment rack being evaluated. If the outcome of stage 704 is NO, then the process ends. If the outcome of stage 704 is YES, then the process proceeds to stage 706, where the power value for the rack being evaluated is increased by an incremental amount. The incremental increase may in some embodiments be user selectable while in other embodiments, the incremental increase may be a default value set in the system. In other embodiments, the incremental increase may be determined, based at least in part, on the amount of available power in the equipment rack. In one embodiment, the incremental increase is a small fraction, e.g. 10% of the maximum additional load that may be added to the rack based on available electrical power capacity.

In other embodiments, other approaches can be used to establish the incremental rack-loading changes so that greater precision in determining maximum rack capacity can be achieved with fewer iterations. In one such embodiment in particular, the iterative process proceeds as follows. First, a lower bound on acceptable-cooling-performance rack load is determined. This is typically taken as the current rack load (at which cooling performance is known to be acceptable) as a starting point. Second, an upper bound on acceptable-cooling-performance rack load is determined. This is typically taken as the maximum additional load possible given electrical power capacity as a starting point. In subsequent iterations, rack-load operating points are tested which fall halfway between the previously tested points. In this manner, the iterative “search space” is continually halved leading to much faster convergence than is achieved with the simple fixed-load-increment approach of FIG. 7. As an example, consider a rack which is currently operating at 4 kW with known acceptable cooling performance. Cooling performance is tested at its maximum electrical power limit of 10 kW and found unacceptable. The next operating point tested is (4+10 kW)/2=7 kW and found to be acceptable. So, the maximum rack load is now known to be between 7 kW and 10 kW so that the next point tested is (7+10 kW)/2=8.5 kW. This process continues until the lower and upper limits on known acceptable cooling performance converge to within some practical tolerance. In one embodiment, process 400 discussed above may use a similar approach to the iterative process of increasing power for all of the racks.

At stage 708, a cooling analysis is performed on the cluster using a cooling calculator. For the additional power added to the equipment rack being evaluated, in one embodiment, the additional cooling air flow (expressed in cfm/kW) is based on the average air flow requirements for the existing equipment in the equipment rack. In typical equipment racks used in data centers, a value of 160 cfm/kw can be used to determine required air flow based on power draw of the equipment. The cooling calculator used at stage 708 can be one of the calculators described in the '109 and '300 patent applications discussed above. In one embodiment, the cooling calculator uses the algebraic calculator for determining capture index (CI) discussed in the '109 application. In one embodiment of the present invention, the cooling performance of the cluster is considered satisfactory if the CI for all racks in the cluster is greater than 90% although this threshold typically increases as cooler supply and surrounding ambient temperatures approach the maximum target rack inlet temperature. In other embodiments, other cooling calculators may be used including CFD calculators.

At stage 710, a determination is made based on the results of the cooling analysis as to whether the cooling performance of the cluster is satisfactory. If the outcome of stage 710 is NO, then the process proceeds to stage 712, where the process is completed as discussed below. If the outcome of stage 710 is YES, then the process proceeds to stage 714, where the Load of each cooler in the cluster is determined. The Load for a cooler is the rate at which heat is removed by the cooler under calculated conditions. In one embodiment, the Load is determined using Equation (4) below: Load=ρ Q ^(air) c _(p) (T _(return) −T _(set point))   Equation (4)

Where: ρ is the density of air=1.19 kg/m³

-   -   Q^(air) is the cooler airflow rate (maximum rated airflow for         cooler type     -   c_(p) is the specific heat of air=1005 J/(kg° C.)     -   T_(return) is the cooler air return temperature determined above     -   T_(set point) is the cooler airflow supply temperature set point

Next, at stage 716, a determination is made as to whether the Load for each cooler is less than or equal to the maximum cooler capacity Cap_(max). The maximum cooler capacity for each cooler is determined in one embodiment using process 400 above. If the outcome of stage 716 is YES, then the process returns to stage 706, where the rack power is incrementally increased again. If the outcome of stage 714 is NO, then the process 700 proceeds to stage 718 where the cooler airflow supply temperature, which is subject to the Cap_(max) limit, is updated using Equation (5) below: T _(supply) =T _(return)−Cap_(max)/(ρ Q ^(air) c _(p))   Equation (5)

At stage 716, a “No” is achieved if cooling capacity is insufficient for any one or more coolers.

After the supply temperature is updated, the process returns to stage 708, where the cooling calculator is run again using the new value for supply temperature.

The process 700 continues until the power is increased to the point where there is not sufficient cooling air for the cluster, and the outcome of stage 710 is NO, indicating that the rack power has been increased to a level that is equal to or greater than the maximum rack capacity, and the process proceeds to stage 712, where process 700 is completed. In one embodiment, when stage 712 is reached, the rack power level is decreased by one increment to provide the maximum rack power level. In other embodiments, if it is desired to provide a finer resolution to the determination of the maximum rack power level, then after stage 712, the rack power level can be decreased by one increment and the process 700 can be performed again using smaller increments.

Processes 500 and 600 discussed above use estimated values for the cooler return temperatures for improper clusters. A process for estimating cooler return temperatures in accordance with one embodiment of the invention will now be described. In the process, a single average cooler return temperature is determined rather than a cooler-by-cooler return temperature. A description of improper clusters is generally provided above. In one embodiment, when an improper cluster of equipment has a hot aisle width greater than six feet, then the cluster is evaluated as two separate improper clusters. Further, if there are partially aligned gaps in each row of an improper cluster, then the improper cluster is evaluated as two separate clusters separated by the gaps. The global average return temperature T^(c) _(ave) is determined using Equation (6) below: T ^(c) _(ave) =β T ^(R) _(ave)+(1−β)T _(amb)   Equation (6)

Where:

T^(c) _(ave)=global average cooler return temperature

T_(amb)=ambient temperature

β=the fraction of cooler airflow that comes directly from the racks (0≦β≦1)

T^(R) _(ave)=the average rack exhaust temperature

In one embodiment, β is determined using Equation (7) and T^(R) _(av) is determined using Equation (8) below:

$\begin{matrix} {\beta = {\frac{\sum\limits_{i = 1}^{n}{{CI}_{i}Q_{i}^{R}}}{\sum\limits_{i = 1}^{N}Q_{i}^{C}} =}} & {{Equation}\mspace{14mu}(7)} \\ {T_{ave}^{R} = \frac{\sum\limits_{i = 1}^{n}{Q_{i}^{R}T_{i}^{R}}}{\sum\limits_{i = 1}^{n}Q_{i}^{R}}} & {{Equation}\mspace{14mu}(8)} \end{matrix}$

Where:

-   -   CI_(i)=hot aisle capture index of rack i     -   n=the number of racks     -   N=the number of coolers     -   Q_(i) ^(c)=airflow rate of cooler i     -   Q_(i) ^(R)=airflow rate of rack i

In at least some embodiments, the CI's are unknown, and the CI's, or more directly, β is estimated. For a global air ratio (AR) of less than one, the coolers are free to capture as much rack airflow as the airflow physics allows. However, when AR is greater than one, the coolers draw in additional “make up air” from the ambient which mixes with the rack airflow reducing the return temperature. In extreme situations for a very large AR, the cooler return temperature is equal to the ambient temperature. Considering the above, in one embodiment, the average cooler return temperature for an improper cluster is estimated using Equation (9) below, with the value of β in Equation (9) being determined using Equation (10) and Table 1:

$\begin{matrix} {T_{ave}^{c} = {{\beta\; T_{ave}^{R}} + {\left( {1 - \beta} \right)T_{amb}}}} & {{Equation}\mspace{14mu}(9)} \\ {{A\; R} = {\frac{\sum\limits_{i = 1}^{N}Q_{i}^{C}}{\sum\limits_{i = 1}^{n}Q_{i}^{R}} = {{the}\mspace{14mu}{global}\mspace{14mu}{air}\mspace{14mu}{ratio}}}} & {{Equation}\mspace{14mu}(8)} \end{matrix}$

TABLE 1 Value of β to Use in Eqn. 9 Value of AR Open from Eqn. 10 Aisle HACS AR < 1   AR ≧ 1 $\frac{\begin{matrix} 0.80 \\ 0.80 \end{matrix}}{AR}$ $\frac{\begin{matrix} 0.95 \\ 0.95 \end{matrix}}{AR}$

As discussed above, for proper clusters, a number of different procedures can be used to determine the cooler return temperatures. An additional process for determining cooler return temperatures in accordance with one embodiment of the invention will now be described. The process may be particularly effective for coolers that draw a significant fraction of their return air directly from the ambient environment rather than directly from the racks.

Cooler return temperatures may be determined using Equation (11) below:

$\begin{matrix} {T_{j}^{C} = \frac{\sum\limits_{i = 1}^{n}{f_{ij}Q_{i}^{R}T_{i}^{R}}}{\sum\limits_{i = 1}^{n}{f_{ij}Q_{i}^{R}}}} & {{Equation}\mspace{14mu}(11)} \end{matrix}$

Where:

T_(j) ^(c)=the return temperature of cooler j

f_(ij)=the fraction of airflow from rack i that is captured by cooler j

Q_(i) ^(R)=the airflow rate of rack i

T_(i) ^(R)=the exhaust temperature of rack i

n=the total number of racks

After all cooler return temperatures for a cluster are computed by Equation 11, the effect of the ambient environment is accounted for by scaling all temperatures uniformly up or down until the overall average cooler return temperature is correct based on aggregate cluster rack airflow rates, exhaust temperatures, and CI's and cooler airflow rates.

In one embodiment of the present invention, a process for determining return temperature for coolers adds an additional term to Equation (11) to account for the amount of air drawn directly from the ambient environment by each cooler, and the resulting equation is shown below as Equation (12):

$\begin{matrix} {T_{j}^{C} = {\frac{\sum\limits_{i = 1}^{n}{f_{ij}Q_{i}^{R}T_{i}^{R}}}{\sum\limits_{i = 1}^{n}{f_{ij}Q_{i}^{R}}} + {\left( {1 - \frac{\sum\limits_{i = 1}^{n}{f_{ij}Q_{i}^{R}}}{Q_{j}^{C}}} \right)T_{amb}}}} & {{Equation}\mspace{14mu}(12)} \end{matrix}$

where Q_(j) ^(C) is the airflow rate of cooler j and T_(amb) is the ambient temperature of the surrounding room. Equation (12) provides the final cooler return temperatures. The f_(ij) values in Equation (12) are determined as shown below in Equations (13) and (14).

For Racks in Row A and Coolers in Row A

$\begin{matrix} {f_{ij} = \frac{\left( Q_{Aj} \right)_{capself}A\;{\mathbb{e}}^{{- B}\;\Delta\; x}}{\left( Q_{Ai} \right)_{supnet}}} & {{Equation}\mspace{14mu}(13)} \end{matrix}$

For Racks in Row A and Coolers in Row B

$\begin{matrix} {f_{ij} = \frac{{C\left( Q_{Bj} \right)}_{capself}A\;{\mathbb{e}}^{{- B}\;\Delta\; x}}{\left( Q_{Ai} \right)_{supnet}}} & {{Equation}\mspace{14mu}(14)} \end{matrix}$

where

-   (Q_(Aj))_(cap self)=The airflow captured by the cooler at location     A_(j) -   (Q_(Bj))_(cap self)=The airflow captured by the cooler at location     B_(j)

Δx=horizontal distance between locations (slots) i and j

A, B are empirical constants

C=empirical “coupling” constant accounting for effects from the opposite row

Constants in one embodiment are A=1, B=0.25 and C varies with aisle width as summarized in the table below.

Aisle Width (ft) 3 3.5 4 4.5 5 5.5 6 C 0.75 0.63 0.5 0.48 0.45 0.43 0.4

And

$\begin{matrix} {\left( Q_{Ai} \right)_{supnet} = {\left( Q_{Ai} \right)_{supself} + {\sum\limits_{{{all}\mspace{14mu} j} \neq i}{\left( Q_{Aj} \right)_{supself}A\;{\mathbb{e}}^{{- B}\;\Delta\; x}}} + {C\left\{ {\left( Q_{Bi} \right)_{supself} + {\sum\limits_{{{all}\mspace{14mu} j} \neq i}{\left( Q_{Bj} \right)_{supself}A\;{\mathbb{e}}^{{- B}\;\Delta\; x}}}} \right\}}}} & {{Equation}\mspace{14mu}(15)} \end{matrix}$

As readily understood by one of ordinary skill in the art given the benefit of this disclosure, calculations for racks in Row B follow analogous equations.

As discussed further above, CI values in embodiments of the present invention may be determined as discussed in the '109 and '300 applications, and in addition, CI values may be determined using Equation (16) below:

$\begin{matrix} {{CI}_{i} = {\sum\limits_{j = 1}^{N}f_{ij}}} & {{Equation}\mspace{14mu}(16)} \end{matrix}$

FIG. 8 shows a process 800 for determining CI values using Equation (16). In a first stage 802, the values for Q_(sup net) are determined for each of the racks using Equation (15). Next, at stage 804, values for f_(ij) are determined for all racks using Equations (13) and (14). At stage 806, the capture index values for all racks are determined using Equation (16). Next, at stage 808, a determination is made as to whether any of the CI values are greater than 1, which would not represent a valid solution. If the outcome of stage 808 is NO, then the process ends at stage 810. If the outcome of stage 808 is YES, then at stage 812, each f_(ij) value is scaled by the capture index, and stages 806 and 808 are repeated using the new f_(ij) values.

Embodiments described above for determining return temperatures may be used with a number of different cooling calculators. Further, the embodiment for determining airflows discussed above with reference to Equation (12) may be used with coolers other than in-row coolers including over-head or over the aisle coolers and traditional data center perimeter cooling units.

In at least some embodiments above, the ambient air temperature of a data center is used in calculators associated with evaluating cooling performance of a data center. Typically, the ambient air temperature of a data center, absent other data, is assumed to be 68 degrees Fahrenheit—which is also the typical cooler supply temperature. In one embodiment that will now be described, an ambient temperature correction process and tool, which may be utilized in systems and processes described above, adjusts the value for the ambient temperature in a data center until there is a balance between total heat from IT equipment in the data center and total cooling provided by all coolers.

In systems and processes described above, the load on coolers as a result of captured exhaust air from equipment racks is determined. Exhaust air that is not captured results in escaped power that heats the room and raises the return temperature of the coolers in the room. The escaped power heats a data center fairly uniformly so that the additional load due to the escaped power is distributed fairly uniformly over all coolers. In one embodiment, the ambient temperature correction process and tool operates in an iterative manner. First, the cooler return temperatures are estimated using an initial assumed ambient temperature (typically 68° F.). Then, the difference between total IT and cooler load in the room is determined using Equation (17): ΔP _(room) =P _(IT) −P _(coolers)   (17) where P_(IT) is the total IT equipment (rack) power and P_(coolers) the total initially-computed load on the coolers.

The correction to the ambient (T_(amb)) and cooler return (T_(r)) temperatures is then computed using Equation (18):

$\begin{matrix} {{\Delta\; T_{r}} = {{\Delta\; T_{amb}} = \frac{\Delta\; P_{room}}{\rho\; c_{p}Q_{c}}}} & (18) \end{matrix}$

where

Q_(c)=total cooler airflow rate

ρ=density of air

c_(p)=specific heat of air at constant pressure

A flowchart of a process 900 in accordance with one embodiment for ambient temperature adjustment is shown in FIG. 9. In stage 910 of the process, the total IT load is determined. Next, in stage 920, the total cooler airflow rate is determined, and in stage 930, the return temperature for the coolers is determined using one of the processes discussed above. In stage 940, the total cooler load is determined, using, for example, Equation 1 above. Next, in stage 950, the difference between the total IT and cooler load is determined using Equation (17), and at stage 960, a determination is made as to whether this difference is equal to zero. If the outcome of stage 960 is YES, then the process ends. If the outcome of stage 960 is NO, then the process proceeds to stage 970, where Equation (18) is used to determine the ambient temperature correction factor, and at stage 980, the return temperature is adjusted based on the outcome of stage 970. Stages 940, 950, 960, 970 and 980 are repeated until the outcome of stage 960 is YES, and then the process ends. In addition to ensuring a proper balance of cooling and IT load, process 900 computes the ambient temperature.

While examples with row-based cooling units were discussed above, processes for ambient temperature correction in accordance with embodiments of the invention may be extended to traditional room-based cooling units and other cooling architectures. In one embodiment, in order to estimate the return air temperature of the room coolers, the fraction of IT load that goes directly to the room-based coolers coolers is estimated using equation 19: ΣP _(CRACS) =γ ΔP _(room)   (19)

where,

ΣP_(CRACS)=the total cooling load on the room-based coolers and γ is the fraction of total cooling airflow in the room that comes from room-based coolers which, in the absence of other information, is estimated as follows:

$\begin{matrix} {\gamma = \frac{\sum Q_{CRACS}}{{\sum Q_{CRACS}} + {\sum Q_{IR}}}} & (20) \end{matrix}$

where,

ΣQ_(CRACS)=the total cooling airflow of Room coolers

ΣQ_(IR)=the total cooling airflow of InRow coolers

Further, it is assumed that all room-based coolers operate with the same return air temperature. The initial estimate of return air temperature for room-based coolers is calculated from:

$\begin{matrix} {T_{CRACS} = {T_{supply}^{c} + {3200\frac{\sum P_{CRACS}}{\sum Q_{CRACS}}}}} & (21) \end{matrix}$

where,

-   -   T_(CRACS)=the return air temperature of room-based coolers in °         F.     -   T_(supply) ^(c)=the cooler supply air temperature ° F.     -   ΣP_(CRACS)=the total cooling load on room-based coolers in kW         and ΣQ_(CRACS) is in cfm.

IT load which is not captured by InRow and Room coolers raises the room ambient temperature. This rise in room ambient temperature (ΔT_(amb)) over cooler supply temperature is calculated by satisfying the energy balance in the room:

$\begin{matrix} {{\Delta\; T_{amb}} = {3200\frac{\left( {1 - \gamma} \right)\Delta\; P}{{\sum Q_{CRACS}} + {\sum Q_{IR}}}}} & (22) \end{matrix}$

where,

ΔT_(amb) is the rise in room ambient temperature in ° F., ΔP is in kW, and airflow rates are in cfm.

In the end, the ΔT_(amb) is added to the originally-estimated room ambient temperature (T_(amb)) and cooler return air temperatures and the cooling load on coolers is updated, using Equation 4.

An embodiment of the invention will now be discussed with reference to FIG. 10 which uses a process 1000 for predicting return temperatures and performing the ambient temperature correction when a combination of row and room-based cooling units are present. At stage 1010 of FIG. 10, the return air temperature (Eqn 12) and cooling load (Eqn 1) of all row-based coolers are determined. At stage 1020, the total uncaptured or escaped power is estimated as the difference between the total IT (rack) load and the total cooler load (Eqn 17). At stage 1030, γ is computed (Eqn 20). At stage 1040 the return temperatures and cooling loads of all room-based coolers are calculated based on Equations 21 and 4, respectively. At stage 1050, the ambient temperature correction (which is numerically equal to the cooler return temperature correction) is computed from Equation 22. At stage 1060 the ambient temperature correction is added to the return temperature of all coolers and at stage 1070 the load on each cooler is updated based on Equations 1 or 4.

In methods of at least one embodiment of the invention, after successful modeling of a cluster in a data center, the results of the model may be used as part of a system to order equipment, ship equipment and install equipment in a data center as per the designed layout.

In at least some embodiments of the invention discussed herein, the performance of assessments and calculations in real-time refers to processes that are completed in a matter of a few seconds or less rather than several minutes or longer as can happen with complex calculations, such as those involving typical CFD calculations.

Having thus described several aspects of at least one embodiment of this invention, it is to be appreciated various alterations, modifications, and improvements will readily occur to those skilled in the art. Such alterations, modifications, and improvements are intended to be part of this disclosure, and are intended to be within the spirit and scope of the invention. Accordingly, the foregoing description and drawings are by way of example only. 

1. A computer-implemented method for evaluating equipment in a data center, the equipment including a plurality of equipment racks, and at least one cooling provider, the method comprising: receiving, by a computer, data regarding each of the plurality of equipment racks and the at least one cooling provider, the data including a layout of the equipment racks and the at least one cooling provider, and a power draw value for each of the equipment racks; storing, by the computer, the received data; determining a cooling load for the at least one cooling provider; determining, by the computer, maximum cooler capacity for the at least one cooling provider based on the layout, and the power draw; for each equipment rack, determining, by the computer, a maximum rack capacity based on the layout and the maximum cooler capacity, by iteratively increasing a value of power for the equipment rack and determining an updated cooling load for the at least one cooling provider until the updated cooling load exceeds the maximum cooler capacity; and displaying, by the computer, an indication of the maximum rack capacity for at least one equipment rack.
 2. The computer implemented method of claim 1, wherein the at least one cooling provider is one of a plurality of cooling providers, and wherein the method further includes determining a maximum cooler capacity value for each cooling provider based on a maximum air return temperature to the plurality of cooling providers.
 3. The computer implemented method of claim 1, wherein determining the maximum rack capacity for each equipment rack includes determining the maximum rack capacity based on available space in each equipment rack and based on available power of each equipment rack.
 4. The computer implemented method of claim 1, further comprising determining cooling performance of each equipment rack based on air flows in the data center.
 5. The computer implemented method of claim 1, wherein determining the cooling load for the at least one cooling provider includes determining the cooling load based on an ambient temperature determined based on a difference between total cooling load in the data center and total power load in the data center.
 6. The computer implemented method of claim 5, wherein determining the cooling load for the at least one cooling provider includes determining the cooling load based on a cooler return temperature determined based on the ambient temperature.
 7. The computer implemented method of claim 1, wherein displaying an indication of the maximum rack capacity includes displaying a model of the data center with the indication of the maximum rack capacity for an equipment rack displayed along with a model of the equipment rack.
 8. A system for evaluating equipment in a data center, the equipment including a plurality of equipment racks, and at least one cooling provider, the system comprising: an interface; and a controller coupled to the interface and configured to: receive data regarding each of the plurality of equipment racks and the at least one cooling provider, the data including a layout of the equipment racks and the at least one cooling provider, and a power draw value for each of the equipment racks; store the received data in a storage device associated with the system; determine a cooling load for the at least one cooling provider; determine a maximum cooler capacity value for the at least one cooling provider based on the layout and the power draw; and for each equipment rack, determine a maximum rack capacity based on the layout and the maximum cooler capacity value, by iteratively increasing a value of power for the equipment rack and determining an updated cooling load for the at least one cooling provider until the updated cooling load exceeds the maximum cooler capacity value.
 9. The system of claim 8, wherein the at least one cooling provider is one of a plurality of cooling providers, and wherein the controller is further configured to determine a maximum cooler capacity value for each of the plurality of cooling providers.
 10. The system of claim 8, wherein determine the maximum rack capacity for each equipment rack includes determine the maximum rack capacity based on available space in each equipment rack and based on available power of each equipment rack.
 11. The system of claim 8, wherein the controller is further configured to determine cooling performance of each equipment rack based on air flows in the data center.
 12. The system of claim 8, wherein determine a cooling load for the at least one cooling provider includes determine the cooling load for the at least one cooling provider based on an ambient temperature determined based on a difference between total cooling load in the data center and total power load in the data center.
 13. The system of claim 12, wherein determine the cooling load includes determine the cooling load based on a cooler return temperature determined based on the ambient temperature.
 14. The system of claim 8, wherein the system includes a display coupled to the controller, and wherein the controller is configured to display an indication of the maximum rack capacity.
 15. A computer readable non-transitory medium having stored thereon sequences of instructions including instructions that will cause a processor to: receive data regarding each of a plurality of equipment racks and at least one cooling provider, the data including a layout of the equipment racks and the at least one cooling provider, and a power draw value for each of the equipment racks; store the received data in a storage device; determine a cooling load for the at least one cooling provider; determine a maximum cooler capacity value for the at least one cooling provider based on the layout and the power draw; and for each equipment rack, determine a maximum rack capacity based on the layout and the maximum cooler capacity value, by iteratively increasing a value of power for the equipment rack and determining an updated cooling load for the at least one cooling provider until the updated cooling load exceeds the maximum cooler capacity value.
 16. The computer readable non-transitory medium of claim 15, wherein the at least one cooling provider is one of a plurality of cooling providers, and wherein the sequences of instructions include instructions that will cause the processor to determine a maximum cooler capacity value for each cooling provider.
 17. The computer readable non-transitory medium of claim 15, wherein determine the maximum rack capacity for each equipment rack includes determine the maximum rack capacity based on available space in each equipment rack and based on available power of each equipment rack.
 18. The computer readable non-transitory medium of claim 15, wherein the sequences of instructions include instructions that will cause the processor to determine cooling performance of each equipment rack based on air flows in the data center.
 19. The computer readable non-transitory medium of claim 15, wherein determine the cooling load for the at least one cooling provider includes determine the cooling load based on an ambient temperature determined based on a difference between total cooling load in the data center and total power load in the data center.
 20. The computer readable non-transitory medium of claim 19, wherein determine the cooling load for the at least one cooling provider includes determine the cooler load based on a cooler return temperature determined based on the ambient temperature. 